Reliable likelihood ratios for statistical model-based voice activity detector with low false-alarm rate
نویسندگان
چکیده
The role of the statistical model-based voice activity detector (SMVAD) is to detect speech regions from input signals using the statistical models of noise and noisy speech. The decision rule of SMVAD is based on the likelihood ratio test (LRT). The LRT-based decision rule may cause detection errors because of statistical properties of noise and speech signals. In this article, we first analyze the reasons why the detection errors occur and then propose two modified decision rules using reliable likelihood ratios (LRs). We also propose an effective weighting scheme considering spectral characteristics of noise and speech signals. In the experiments proposed in this study, with almost no additional computations, the proposed methods show significant performance improvement in various noise conditions. Experimental results also show that the proposed weighting scheme provides additional performance improvement over the two proposed SMVADs.
منابع مشابه
Adaptive Signal Detection in Auto-Regressive Interference with Gaussian Spectrum
A detector for the case of a radar target with known Doppler and unknown complex amplitude in complex Gaussian noise with unknown parameters has been derived. The detector assumes that the noise is an Auto-Regressive (AR) process with Gaussian autocorrelation function which is a suitable model for ground clutter in most scenarios involving airborne radars. The detector estimates the unknown...
متن کاملSelection of Reliable Likelihood Ratios for Statistical Model-Based Voice Activity Detection
A statistical model-based voice activity detection (VAD) is a robust algorithm in noisy condition to detect speech region from input signal by speech and non-speech statistical model such as complex Gaussian probability density function (PDF). The decision rule used in this VAD is based on Bayes’ rule and considers likelihood ratios (LRs) in whole frequency region. In this VAD, however, the Bay...
متن کاملA Bayesian approach to voice activity detection using multiple statistical models and discriminative training
In this study, the problem of voice activity detection (VAD) is formulated in a Bayesian hypothesis testing framework. Unlike traditional VAD schemes that employ a single statistical model, multiple models are assumed to be potentially engaged with a priori probabilities, due to the statical diversity of the environmental noise degrading the speech. Moreover, the optimal a priori probabilities ...
متن کاملDual-microphone Voice Activity Detection Incorporating Gaussian Mixture Models with an Error Correction Scheme in Non-stationary Noise Environments
In this paper, a voice activity detection (VAD) method is proposed based on Gaussian mixture models (GMMs) by exploiting the spatial selectivity in dual-microphone environments. In other words, each GMM is constructed according to the direction-ofarrival (DOA) to detect speech intervals. Based on the assumption that the target speech is located in front of dual-microphones, the VAD is performed...
متن کاملPhysiology-Invariant Meal Detection for Type 1 Diabetes.
BACKGROUND Fully automated artificial pancreas systems require meal detectors to supplement blood glucose level regulation, where false meal detections can cause unnecessary insulin delivery with potentially fatal consequences, and missed detections may cause the patient to experience extreme hyperglycemia. Most existing meal detectors monitor various measures of glucose rate-of-change to detec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- EURASIP J. Adv. Sig. Proc.
دوره 2011 شماره
صفحات -
تاریخ انتشار 2011